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Abstract 



Let {Gt}t>o be the random graph process (Go is edgeless and Gt is obtained by adding a 
uniformly distributed new edge to Gt-i), and let Tk denote the minimum time t such that the 
fc-core of Gt (its unique maximal subgraph with minimum degree at least fc) is nonempty. For 
any fixed fc > 3 the fc-core is known to emerge via a discontinuous phase transition, where at time 
t = Tfc its size jumps from to linear in the number of vertices with high probability. It is believed 
that for any fc > 3 the core is Hamiltonian upon creation w.h.p., and Bollobas, Cooper, Fenner and 
Frieze further conjectured that it in fact admits |_(/c — 1) /2J edge-disjoint Hamilton cycles. However, 
even the asymptotic threshold for Hamiltonicity of the fc-core in Q(n,p) was unknown for any fc. 
f-n ■ We show here that for any fixed fc > 15 the fc-core of Gt is w.h.p. Hamiltonian for all t > r^, i.e., 

immediately as the fc-core appears and indefinitely afterwards. Moreover, we prove that for large 
enough fixed fc the fc-core contains [(fc — 3) / 2J edge-disjoint Hamilton cycles w.h.p. for all t > Tk. 

1 Introduction 

The fundamental problem of establishing the threshold for Hamiltonicity in the Erdos-Renyi random 
(T) \ graph traces back to the original pioneering works of Erdos and Renyi introducing this model (see [13]). 

^ | Following breakthrough papers by Posa [29] and Korshunov [20], this question has been settled by 

Bollobas [4| and by Komlos and Szemeredi [19] ; who proved that if p = (logn + log log n + u) n )/n 
for any diverging increasing sequence u n then the probability that a random graph G drawn from 
the probability space G(n,p) of binomial random graphs is Hamiltonian tends to 1 as n — > oo. This 
threshold precisely coincides with the one for having minimum degree 2 in G(n,p), and indeed a refined 
hitting time version of this result was proved by Bollobas [4 J and by Ajtai, Komlos and Szemeredi [1]. 
They showed that in a typical random graph process {Gt}t>o on n vertices (Go is edgeless and Gt is 
obtained by adding a uniformly distributed new edge to Gt—i), a Hamilton cycle appears exactly at 
the moment t when the minimum degree of Gt becomes 2 (an obvious prerequisite for Hamiltonicity). 

With vertices of degree less than 2 being the primary obstacle for Hamiltonicity in the random 
graph, it was natural to instead consider the subgraph formed by (repeatedly) excluding such vertices. 
The k-core of a graph G, denoted here by G^ k \ is the (unique) maximal subgraph of G of minimum 
degree at least k (equivalently, it is the result of repeatedly deleting vertices of degree less than k). 
This concept was introduced in 1984 by Bollobas [4], who showed that for fixed k > 3 the fc-core is 
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typically nonempty and ^-connected already at p = C(k)/n. Luczak [23] established in 1987 that 
the threshold for the /c-core to contain [k/2\ edge-disjoint Hamilton cycles, plus a disjoint perfect 
matching for odd k, is when p = (^rjlogn + /cloglogn)/n. In particular, the 2-core of Q(n,p) is 



sequence u n , preceding the Hamiltonicity of the entire graph roughly by a factor of 3. However, for 
any fixed k > 3, the threshold for the /c-core to contain a (single) Hamilton cycle remained open. 

The best upper bound to date on this threshold is (2 + Ofc(l))(& + l) 3 /ra by Bollobas, Cooper, 
Fenner and Frieze [6] via their study of the related model Q(n,m,k), a graph uniformly chosen out 
of all those on n labeled vertices with m edges and minimum degree at least k. It is well-known 
(and easily seen) that, conditioned on its number of vertices N and edges M, the /c-core of Q(n,p) is 
distributed as Q(N, M, k), hence the results on the latter model for a suitable M = M(N) carry to 
for G ~ Q(n,p). It was shown in [6], following a previous paper of Bollobas, Fenner and Frieze [7j, that 
for any fixed k > 3 the graph G ~ G(n, m, k) w.h.p. contains [_(k — l)/2j edge-disjoint Hamilton cycles 
(plus a disjoint perfect matching when k is even) as long as m/n > (k + l) 3 (see the recent work [14j 
for an improvement in the special case k = 3). This can then be translated to G ~ G(n,p) to imply 
that, for any fixed k > 3, its /c-core is Hamiltonian w.h.p. provided that p > (2 + Ok(l))(k + l) 3 /n. 

An obvious lower bound is the first appearance of the /c-core, which on its own involves a remarkable 
first-order phase transition along the evolution of the random graph: Letting k > 3 and defining 



to be the first time at which the random graph process gives rise to a nonempty /c-core, Luczak p4,25 
showed that \G)- k \ > n/5000 w.h.p. That is, in a typical random graph process, the /c-core remains 
empty until the addition of a single edge creates a chain reaction culminating in a linear-sized core. 
This fascinating phenomenon triggered a long line of works (see, e.g., [9T[l2 ^ [T6 t [T7 1[26ti28t l30 y 31 j ) . Most 
notably, the tour-de-force by Pittel, Spencer and Wormald [28] determined the asymptotic threshold 
for its emergence to be p = Ck/n for Ck = infA>o A/P(Poisson(A) > k — 1) = k + V k log k + 0(\/fc), as 
well as characterized various properties of the core upon its creation, e.g., its size, edge density, etc. 

Main results. The threshold for the Hamiltonicity of the /c-core of Q(n,p) is roughly between k/n 
and 2/c 3 /n, as discussed above. Comparing these, it was long believed that the lower bound should be 
asymptotically tight, and indeed Bollobas, Cooper, Fenner and Frieze [6] conjectured (see also [30]) that 
their result could be pushed down to about k/n, and in particular, the /c-core should be Hamiltonian 
w.h.p. for p > Ck/n. Furthermore, since the fc-core is known to enjoy excellent expansion properties 
(being a random graph with a particularly nice degree distribution, namely, Poisson conditioned on 
being at least k) immediately upon its creation, one could expect the hitting time version of this result 
to also be true (see, e.g., [181 Chapter 5] for related questions). That is, w.h.p. the fe-core should be 
Hamiltonian since the very moment of its birth in the random graph process, i.e., at time Tfe. Here we 
are able to show this stronger statement for any k > 15, as given by the following theorem. 

Theorem 1. Let {Gt} be the random graph process on n vertices. Fix k > 15 and let Tk be the hitting 
time for the appearance of a k-core in Gt- Then w.h.p. the k-core of Gt is Hamiltonian for all t > Tk- 

Note that the Hamiltonicity of the /c-core is a non-monotone property, and as such, guaranteeing 

(k) 

that G\ is Hamiltonian for all t > Tk is nontrivial even given that it is Hamiltonian at time t = Tk- 
1 An event is said to occur with high probability (or w.h.p. for brevity) if its probability tends to 1 as n — >• oo. 




increasing 



Tk = mm 



{t : G\ k) + 0} 



(1.1) 
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This is due to the fact that the k-core dynamically grows with the addition of edges along the random 
graph process. With each new vertex that joins the fc-core, we must immediately find a new cycle 
going through this vertex as well as all the other vertices of the core. This dynamic growth of the 
/c-core poses additional challenges for the proofs, as we will now explain. 

A common recipe for establishing Hamiltonicity in random graph models relies on sprinkling new 
edges, through which longer and longer paths are formed in the graph, eventually culminating in 
a Hamilton path that is followed by a Hamilton cycle. To show that the graph G at time T\ is 
Hamiltonian via this framework, one can examine the graph at time Tq = T\ — cn, call it H, and 
show that it enjoys the following property: Out of all edges missing from H, a constant proportion are 
"good" in the sense that adding any single good edge will support the above extension of the maximum 
path length. This is typically established using strong expansion properties of H, and in particular, 
this property is retained when additional edges are added to the graph. As such, when adding cn 
edges (for an appropriate constant c > 0) along the interval (Tq,Ti), one can collect (w.h.p.) up to n 
new good edges, leading to a Hamilton cycle. (A version of this framework also exists for a choice of 
To = T\ — o(n), provided that the initial graph H satisfies additional more delicate properties.) 

In our setting, the above framework faces a major obstacle due to the dynamical nature of the 
/c-core, already present if one merely wishes to show the weaker asymptotic version of Theorem Q] (i.e., 
that for any fixed e > the fc-core of G ~ Q(n,p) is w.h.p. Hamiltonian at p = {ct + £)/n). Indeed, 
the very same sprinkling process that repeatedly extends our longest path simultaneously enlarges the 
k-core and might destroy the aforementioned properties of H = Gt and foil the entire procedure. 

Even more challenging is the obstacle in the way of a hitting time version (Hamiltonicity of G^ at 
t = r fc)> due to the first-order phase transition by which the /c-core emerges. Already at time — 1 the 
£;-core is empty, thus inspecting H = Gt for To < T\ = would not reveal which subgraph of Gt x 
will host the future core (and the sought properties will clearly not hold for every possible subgraph). 

The novelty in this work is a different setup for sprinkling, hinging on conditioning on the future 
at time To. Roughly put, at time To we condition on the subset of vertices V that will host the fc-core 
at time T\, as well as on some of the edges along the interval (Tq,T\), in a careful manner such that: 

(i) A constant fraction of the edges along the interval (T),Ti) remains unrevealed. 

(ii) The unrevealed edges are uniformly distributed over a (linear) subset of the edges that, all 
throughout the sprinkling process, contains most of the good edges. 

Observe that one can establish various properties of the induced graph of H on V (e.g., expansion) 
and these would now be retained throughout the sprinkling (as V remains static in the new setup). 
Properties (JTJ> and (|n|) then enable one to appeal to the standard sprinkling framework for Hamiltonicity. 
In our context, combining this method with the classical rotation- extension technique of Posa [29! 
reduces Hamiltonicity to the task of verifying certain properties of the induced graph on V' at time To 
(namely, suitable expansion and the existence of a sub-linear number of cycles covering all vertices). 
The latter task is where we require that k > 15 for the arguments to hold (and be reasonably short), 
and it is plausible that a more detailed analysis of this technical part would relax this restriction. 

Engineering the above setup to begin with, however, is highly nontrivial: Indeed, the identity of 
the vertex set of the future core V can give away a great deal of information on the upcoming edges 
until time Tl, easily biasing them away from being uniform on a given set of edges. Moreover, the set 
of good edges dynamically changes, and depends on the structure of the A:-core in a complicated way... 
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The key step is to examine the set V, conditioned to host the k-core at time T±, and partition it 
into Bk C V and Ck = V \ B^, based on the graph H at time Tq = T\ — o(n) in the following manner: 
The set Bk will consist of every v € V such that v has at least k neighbors in V' already at time To. 
We then reveal all edges along the interval (Tq,Ti) except those with both endpoints in Bk (giving 
away only the total number of such edges encountered along this interval) with the intuition being: 

(i) Every v £ Bk already guaranteed its inclusion in the fc-core, thus the position of the unrevealed 
edges cannot interfere with our conditioning on V' , and consequently these edges are uniform. 

(ii) As the fc-core is linear and T\ — Tq = o(n), we have \Bk\ = (1 — o(l))|V'| and so almost all the 
good edges in our sprinkling process are going to have both of their endpoints lie within Bk ■ 

Of course, turning this intuition into a rigorous argument requires a delicate analysis, working around 
several complications that conditioning on the future V entails. The full details are given in Section [2j 

Having settled the issue of Hamiltonicity in fc-cores for k > 15, we turn our attention to packing 
Hamilton cycles, addressing the conjecture of Bollobas, Cooper, Fenner and Frieze [6J that the k-core 
should give rise to \ {k — l)/2j edge-disjoint Hamilton cycles (plus a perfect matching when k is even). 

As noted in [BJ, packing \_{k — l)/2j edge-disjoint Hamilton cycles in the fc-core of Q(n,p) for 
p = 0(1/ n) would be best possible w.h.p. The obstruction, as in the previous work of Luczak |23j, 
is the presence of a so-called k-spider, a vertex of degree k + 1 whose neighbors all have degree k. 
(Whence, with k being even, packing k/2 Hamilton cycles would clearly use all the edges incident to 
these degree-/c vertices, and in the process exhaust all k + 1 edges incident to the root, impossible.) 
As mentioned above, packing this many Hamilton cycles (possibly plus a matching) in the /c-core was 
shown in [UJ to be possible roughly at p = 2/c 3 /n, compared with its emergence threshold of p ~ k/n. 

Here we nearly settle this conjecture, albeit for a sufficiently large k, by extracting [(k — 3)/2j 
edge-disjoint Hamilton cycles (one cycle short of the target number) as soon as the /c-core is born. 
Similarly to the result in Theorem [H this packing is then maintained indefinitely afterwards w.h.p. 

Theorem 2. Let {Gt} be the random graph process on n vertices. Let denote the hitting time for 
the appearance of a k-core in Gt for some large enough fixed k. Then w.h.p. the k-core of Gt contains 
L(/c — 3)/2j edge-disjoint Hamilton cycles for all t > r^. 

The proof of the above theorem follows the same framework of the proof of Theorem [TJ with the 
main difference being the properties we require from the induced subgraph on the future /c-core V 
before commencing the sprinkling process. Instead of seeking a single 2-factor in this subgraph as in 
the proof of Theorem [Tj the new goal becomes finding a [(k — 3)/2j edge-disjoint 2-factors, which 
we can then convert to Hamilton cycles one at a time. To this end we rely on the recent results of 
Pralat, Verstraete and Wormald |30| and of Chan and Molloy [XOj , which showed the existence of 
(k — 2)-factors and (k — l)-factors in the /c-core, respectively, for any sufficiently large k. 

Organization and notation. In £}2] we prove Theorem Q] modulo two technical lemmas (Lemmas 12.31 
and 12.41 whose proofs appear in $3]) on typical properties of /c-cores in the critical and the supercritical 
regimes, resp. The proof of Theorem [2] appears in 21 and the final section £}5] contains open problems. 

Throughout the paper, all logarithms are in the natural basis. Our graph theoretic notation is 
standard: For a given graph G and subsets X, Y of its vertices, ec{X) and ec(X, Y) denote the 
numbers of edges of G spanned by X and between X and Y, respectively. We let Ng(X) denote the 
set of neighbors of X in G, and occasionally refer to Ng(X) \ X as the external neighbors of X in G. 
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2 Hitting time for Hamiltonicity of the core 



In this section we prove Theorem [T] modulo two technical lemmas whose proofs will be postponed to 
Section [3j The task of establishing that the fc-core of almost every random graph process is Hamiltonian 
as soon as it emerges and indefinitely afterwards is divided into two separate regimes: an 0(n)-interval 
of the random graph process surrounding the abrupt emergence of the k-coie, which we refer to as the 
critical regime, and the 0(nlogn)-interval that immediately follows it, referred to as the supercritical 
regime. Our aim in the critical regime is to establish Hamiltonicity w.h.p. under the assumption that 
the £>core already exists, whereas in the supercritical regime we aim to maintain the Hamiltonicity 
property even as the fc-core continues to grow. Achieving these goals are the following Theorems 12.11 
and 12.21 which address the critical and supercritical regimes, respectively. 

Theorem 2.1. Fix k > 15 and let G ~ G(n,m) for some m satisfying max{fc/2,10}n <m< ^kn. 
Then with probability 1 — o(n~ 3 ) either is empty or it is Hamiltonian. 

Theorem 2.2. Fix k > 15, let H ~ Q(n,m) for \{7k — l)n < m < ralogn, and let G be the graph 
obtained by adding [n/8\ uniformly distributed new edges to H. Then the probability that is 
Hamiltonian, and at the same time G^ k ' is connected but not Hamiltonian, is at most o(n~ 3 ). 

Using well-known properties of the k-core, Theorem [T] is obtained as an immediate corollary of 
Theorems 12.11 and 12.21 Indeed, taking a union bound over the interval I\ = [max{fe/2, 10}n, |/cn] we 
can extend the conclusion of Theorem 12. II to hold with probability 1 — o(n~ 2 ) simultaneously for all Gt 
with t G I\. Thus, provided that r^, falls within this interval, this implies Hamiltonicity upon creation 
and up to t < ^kn. The explicit threshold in Q{n,p) for the sudden emergence of the fc-core for k > 3 
is known (see, e.g., PHI2SH23) to be Cfc/n for c k = k + ^k log k + 0(Vk). Specifically in our context, 
one can verify that C15 ~ 20.98 and max{20, k} < c k < 2k holds for any k > 15. The corresponding 
threshold for the emergence of the fc-core along the random graph process is concentrated around 
Cfcn/2, and moreover it is known ( |17} Theorem 1.4]) that (r^ — Cfcn/2)/y / n converges in distribution 
to a Gaussian N(0,cr|) for some (explicit) a\ > 0. In particular, G [max{/j/2, 10}n, kn] except with 
probability exponentially small in n, and altogether Gt is w.h.p. Hamiltonian for all r/% < t < ^kn. 

In order to maintain this property throughout the evolution of the random graph process (in which 
the /c-core grows and consequently might not remain Hamiltonian), it suffices to do so up to t = ralogn, 
as it is well-known that Q(n,m) for m = i^nlogn, for any constant e > 0, is w.h.p. fc-connected 
and Hamiltonian (see, e.g., [5,18] and the references therein). A union bound over the time interval 
I2 = [■gkn — n/6, nlogra] extends the conclusion of Theorem 12.21 to hold with probability 1 — n~ 2+ °^ 
simultaneously for all Gt with t G 1%. With this probability we have that Hamiltonicity at time t G I2 
is carried to time t + n/8, unless the fc-core at this later time happens to be not connected. It was 
shown by Luczak ( [251 Theorem 5'], see also [23]) that w.h.p. throughout the entire random graph 
process {Gt}, whenever the fc-core is nonempty it is necessarily fe-connected and contains at least 
ra/5000 vertices. It therefore follows that Theorem 12.21 can extend the Hamiltonicity from an interval 
[a, a + n/8] for some a > ^kn — n/6 to hold w.h.p. for all t > a. As < kn w.h.p., with the above 
conclusion from Theorem 12. II we can take a = \^kn — n/6] and thus Gt is w.h.p. Hamiltonian for all 
t > Tfc, concluding the proof of Theorem [TJ 

In the remainder of this section we prove Theorems 12.11 and 12.21 modulo Lemmas 12.31 and 12.41 which 
are stated next and whose proofs appear in Sections 13.11 and 13.21 respectively. 
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Lemma 2.3. Fix k > 15 and let G ~ Q(n,m) for m = cn/2 such that max{£;,20} < c < |/c. With 
probability 1 — o(n~ 3 ) the following holds for any G = (V',E') C G of minimum degree 5(G') > k 
such that \V'\ > |ra and the induced subgraph of G on V' contains at most \E'\ + n/ log log re edges: 

(i) the graph G' is connected, 

(ii) every X C V of size \X\ < n/c 2 satisfies \N G >(X) \X\ > 2\X\, and 
(Hi) there exists a factor in G' consisting of a path P and at most n/y/logn cycles. 

Lemma 2.4. Fix k > 15, let H ~ Q(n,m) for ^(7k — l)n < m < nlogn and let G be obtained by 
adding n/8 uniformly chosen new edges to H. Let G' = (V',E r ) C G be the graph where V = V(G^) 
and uv E E' for u,v E V if either uv E E(H) or alternatively uv E E(G) \ E(H) and at least one of 
u,v belongs to V \ V(H^). Then with probability 1 - o(r?T 3 ) we have \ V'\ > \V(H^)\ > 0.999n and 
every subset X C V of size \X\ < n/5 satisfies \Nq>(X) \X\> 2\X\. 

Proof of Theorem 12. 11 Given two graphs H C G on the same vertex set V, define the partition of 
the vertex set of G^ k \ the fc-core of G, into Bk = Bk(G,H) and Ck = Ck(G,H) as follows: 

B k = {ueV(G^) : \N H (u)nV(G^)\ > k] , C k = V(G^)\B k . (2.1) 

That is, B k consists of all vertices of G^ which possess at least k neighbors within V(G^) already 
in the subgraph H. It is easily seen that Ck is empty iff V(H^) = V(G^), and that every vertex of 
Ck must have an edge incident to it in E(G) \ E(H). We will be interested in settings where G^ is 
nonempty and \E(G) \ E(H)\ = o(n), whence Bk will comprise the bulk of the k-core of G. 

Let S = (ei, . . . , e m ) be a uniformly chosen ordered subset of m edges out of the Q) possible ones 
on the vertex set V = [n], and let G and H be the graphs on the vertex set V with edge sets 



E(G) = {ei, . . . ,e m } , 

E(H) = {e±, . . . ,e m '} where m=m 



n 



log log n 



As G ~ G(n,m), if G^ is empty then there is nothing left to prove. We therefore assume otherwise 
and aim to establish that G^ is Hamiltonian with probability 1 — o(n -3 ). The key to the proof will 
be to examine the random variables Bk(G,H) and Ck(G,H) as defined above. Already we see that 
\Ck\ < 2(m — m!) = 2n/loglogn, which would make it negligible compared to the linear size of G^. 

Denote the indices of the subset of the final m — m' edges having both endpoints in Bk by 

J = jm' < j < m : ej = (u,v) for some u,v E Bk} , (2-2) 

and let J = \J^\. We claim that upon conditioning on Bk, Ck (such that BkUCk / by our assumption) 
as well as J and all the edges S* = {ej : j J}, the remaining J edges of S are uniformly distributed 
over all edges missing from Bk (that is, edges with both endpoints in Bk that did not appear among 
e\, . . . ,e m i). To establish this it suffices to show that adding any possible set of J new edges with 
endpoints inside Bk to S* would result in a graph such that Bk U Ck is consistent with our conditioning 
(since each such extension would have the same probability, and the partition V' = Bk U Ck only 
depends on the graph H and the vertex set V). 

Indeed, letting F = (V,S*) we argue that V{F^) = V(G^). The fact that V(F^) C V(G^) 
follows trivially from having S* C S. At the same time, V(G^) C V(F^) since every u E C k (G,H) 
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has Np(u) = Ng(u) (which contains, in particular, at least k vertices from V(G^)) whereas every 
u e B k (G, H) has N H (u) C N F (u) and \N H (u) n V{G^)\ > k by the definition of B k {G,H). It is 
now clear for any subset S of edges whose endpoints all lie within B k (G,H) (or even within V(GW) 
for that matter), the vertex set of the &;-core of F' = (V, S* U S) remains equal to V(G^). That is, 
every extension of S* via J new internal edges of B k is consistent with our conditioning, as claimed. 

A delicate point worth noting is that the above statement would be false if we were to directly 
take m = r k , the hitting time for the emergence of the fe-core. In that case, it could occur that adding 
certain edges between the vertices of B k to the graph F = (V, S*) would induce a different fc-core to 
appear at some earlier time j < m, thus violating our conditioning. This issue disappears once we fix 
m deterministically, thereby guaranteeing that no edges within B k would be forbidden. 

The next crucial step is controlling the random variable J, which counts the number of edges within 
B k between times m'+l, . . . ,m. Our goal is to show that J/(m — m') is uniformly bounded from below, 
that is, that a constant fraction of the edges e m / +1 ... ,e m are internal edges of B k . Intuitively, one 
could hope that the distribution of the edges in E(G) \ E(H) would be close to uniform. In that case, 
since the graphs G, H under consideration are such that B k is of linear size, standard concentration 
arguments would yield the sought estimate except with a probability that is exponentially small in n. 
While such an argument would be valid for a fixed set of vertices in H, unfortunately the set B k is 
random and does depend on e m > + i, . . . , e m . 

For instance, consider the situation where conditioned on H and then on the partition B k U C k we 
have that C k = 2(to — m'). This occurs when the edges e m '+i, . . . ,e m form a matching on the vertices 
of C k (so as to accommodate the degree constraints of all 2 (to — to') of its vertices). Rare as this event 
may be, it demonstrates the possibility that J = (even deterministically) in this conditional space. 
Another example of the delicate dependence between J and B k is the following: Suppose that instead 
of B k we would work with its variant B' k , consisting of all vertices of V(G^) whose set of neighbors 
within V{G^) is the same in G as it is in H. Then B' k C B k and these two sets differ by at most 
to — to' = o(n) vertices, and yet the variable J' corresponding to B' k satisfies J' = by definition. 

As the next claim shows, it is possible to control the random variable J despite the dependence 
between the B k and E(G) \ E(H). 

Claim 2.5. Let J = \J\ for J as defined in ([22]). Then 

P ( J > n/(100 log log n)) = l- 0(n~ 9 ) . 

Proof. The idea underlying this approach would be to approximate B k via another random set whose 
identity is completely determined by G (as opposed to the combination of G and H). 

Let A\ = A\{G) denote the event that there is a subset S of s = [f^J vertices of G which spans 
at most r = m/50 edges. Taking p = c/n for c = 2m/n we appeal to the standard correspondence 
between Q(n,m) and G(n,p), and since r ~ jqS 2 p we get that 

mo < (r+i)(;)(®y ( i- P )G)- < ( ( i +0 (i))f ($) r/ v- /2 y 

< ((|e + o(l)) (Sef^e-^y = ((fe + o(l)) {Se~^) S < (3/4)% 

where the last inequality uses our hypothesis that c > 20. It follows that the probability of encountering 
Ai is at most 0(exp(— oqw)) for some absolute oq > 0. 
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Let X\ = Xi(G) denote the number of vertices of degree precisely I in for a fixed integer 
I > k. It is known ([9j Corollaries 2 and 3]) that the fraction of vertices of degree I within the fe-core 
converges to P(Z = I) where Z is a Poisson random variable with mean u for some explicitly given 
fjL = u(k) > k. More precisely, for any fixed e > 0, 



>en )=0(e- nai ) , (2.3) 



e V 

A; n 

where a\ = a\{k) > is fixed. One should note that [9] offers (sharper) estimates only for the 
supercritical range c> c k . However, one may extend these to c ~ c k , given that the k-core is nonempty 
at that point, as follows. Fixing some arbitrarily small 5 > 0, at d = c k + 5 these bounds yield 
concentration for X[ within a window of width en (for any fixed e > 0) except with an exponentially 
small probability. Next, appealing to the known estimates on |V(G^)| and \E(G^)\ throughout 
the critical window for the emergence of the /c-core (see \17\ Theorem 1.3]), it is known that the 
difference in these two random variables between times cn/2 and c'n/2 is at most some en except 
with probability 0(e~" ai ) for some a\ > fixed, where e can be made arbitrarily small via selecting 
a suitably small 5. Consequently, the value of X\ cannot change by more than 2en along this interval 
(accounting for lost edges as well as lost vertices via the change in V(G^ k ') and E(G^ k >), respectively), 
thus establishing (|2,3p with a window of 3en. To complement this bound, observe that for fixed I > 15, 

e-y < e -y _ e-fr-')(l + ^)* < 1 1 
U - ^(l/e)' ~ V2nl ~ 8' 

where the first inequality above followed from Stirling's formula and the last one holds for I > 15. 
Altogether, we can infer that the number of vertices whose degree belongs to {k, k + 1, k + 2} satisfies 

P (X k + X k+1 + X k+2 < In) = 1 - O {e~^ n ) . 

At the same time, known properties of the A;-core upon its emergence (recall that G^ is nonempty 
by assumption), namely the explicit formula for its typical initial size as well its concentration around 
its mean (see, e.g., |28} Theorem 3]), imply that for some absolute a>2 > and any fixed k > 15, 

-n a 2\ 



(|^(G (fc) )| > fn) = l-O 



Ki=loglogn, k 2 = 10 —. ; , P = - 2 , (2.4) 



Let A 2 = A 2 {G) denote the event that X k + X k+ i + X k+2 > |n or \V(G^)\ < |n. The last two 
estimates then imply that P(^2) < 0(e~ na2 ), and on the event A 2 we see that at least |n vertices 
have degree at least k + 3 in G^ k \ 

A final ingredient we need is some control over vertices of large degree in G. Set 

log n n 
log log n ' ^ log 2 n 

and let A3 = A%(G) denote the event that either A(G) > k 2 , where A(G) denotes the maximum 
degree of G, or there are p vertices of G whose degree exceeds k\. It is well-known that whenever 
m/n is uniformly bounded from above, the maximum degree in Q(n,m) is at most (1 + o(l)) Io ^g n 
w.h.p. (see, e.g., [5j §3]), and moreover, the probability this maximum degree would exceed k 2 as given 
in (|2.4|) is at most 0(n~ 9 ). Turning to the probability that a given v £ V has degree at least n\ in G, 
again working with the corresponding G(n,p = c/n) model with c = 2m/n we find it to be 

P(Bin(n- l,p) > «i) = 0(c Ki /k 1 \) = e -(i-°(i)W i°s^i < e ~^i = (\ og n)- 5 , 
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where the strict inequality holds for large n. We will now argue that the probability of encountering p 
such vertices (recall that p = n l ~°^) would be exp(— n 1-0 ^ 1 )). Indeed, the probability of encountering 
a set S of p vertices such that the induced subgraph on S contains more than 10/? edges is at most 



At the same time, in a set S of p vertices whose degrees are all at least k\ and where additionally 
the induced subgraph on S has at most 10/3 edges we must have at least p(K\ — 20) edges in the cut 
between S and V(G) \ S. Similarly to the previous calculation, the probability of this event is at most 

fn\f p(n-p) \ )p{Kl -20) = ( c i oc 2 n ( <n-p)c \^\ P _ (1 _ o(1)W i ogK1 _ ,- w '-°M 
\p) \p( Kl -20)) K 11 I s V(«i-20W ) 



e 



Altogether we deduce that ^3 occurs with probability at most 0(n 9 ) with room to spare. 

Let U C V(G^) be the set of all vertices whose degree in is at least k + 3. It is important 
to note that U as well as the properties addressed by the events ^1,^2,^3 are entirely determined 
by the edge set E(G) = {e±, . . . , e m } regardless of the order in which they appeared (as opposed to 
Bk,Ck which were a function of G and H). Condition on this edge set E(G), unordered, and further 
condition on the event A c = A^Ci A^Ci A%, recalling that F(A C ) = 1 — 0(n~ 9 ) by the above analysis. 

Under this conditioning, the edge set E(H) is obtained as a uniform subset of m! of these edges, 
hence the variable 

Jjj = {uv G E{G) \ E(H) : u,v G U} 

is hypergeometric. Specifically, Jjj is the result of m — ml = nj log logn samples without repetition 
with the target population having proportion at least 1/50, since A\ guarantees at least m/50 internal 
edges within any set of |n vertices whereas A\ implies that \U\ > |n. Hoeffding's inequality for 
hypergeometric variables [15] now tells us that the probability that for some absolute constant 03 > 0, 



P(Ju <(m- m')/75 | E(G) , A c ) < e " a 3("') 



e - n i-°(i) 



where the last inequality holds for large enough n. 

Now let W\ C U denote the subset of vertices of U which have degree at most k\ in G and 
are incident to at least 4 edges of E{G) \ E{H). The number of edges in the sample E{G) \ E{H) 
that are incident to a fixed set of s such vertices is stochastically dominated by a binomial variable 
Bin(m — m' , SKx/m'), as there are overall at most ski edges incident to this set in G. Hence, 

W(\Wi\ > p I E(G) , A c ) < P (Bin (m — m , pni/m) >2p) < 

< ( en ploglogn \ 2p _ / e (l) \ 2p _ -n 1 -^ 1 ) 
~ V2ploglogn (10 - o(l))n) V20 ° [ ') 6 

where the first inequality in the second line used the hypothesis m! ~ m ~ cn/2 > lOn. Conditioned 
on this event, our assumption on the maximal degree of G implies that the number of edges incident 
to W\ in G is at most K2P = o(n/ logn). Similarly, letting W 2 C U be the set of vertices whose degree 
in G exceeds «i, we know by assumption that < P an d that the number of edges incident to this 
set in G is then at most K2P = o{nj 'logn). 
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Letting U* = U \ (W\ U W2), by definition we have that every vertex of U* is incident to at most 
3 edges of E(G) \ E(H). At the same time, the degree in of every v & U is at least k + 3, 
and consequently each v G U* has at least k neighbors among V(G**>) in H, hence U* C B k . The 
above estimates show that, except with probability exp(— n 1-0 ^ 1 )), there are at least n/(751oglogn) 
edges with both endpoints in U (counted by the variable J\j above) whereas the total number of edges 
incident to W\ U W2 = U \ U* is o(n/ log n) = o{n/ log log n). Therefore, 

P (J < n/(100 log log n) | £7(G) , A c ) < e~ nl ~ o(1) , 

and the desired (unconditional) estimate on J now follows from the fact that P(^ c ) = 1 - 0(n~ 9 ). ■ 

We can now turn to Lemma 12.31 taking the target graph G' to be for F = (V, S*). As 
mentioned above, it is known that whenever the fc-core for k > 15 is nonempty, its size is at least 
in except with probability exponentially small in n. Recalling that V(F^) = V(G^) and that 
E(G)\E(F) contains J < n/ log log n internal edges of B^, the requirements of the lemma are satisfied. 
This lemma will be used in tandem with the following well-known corollary of the classical rotation- 
extension technique of Posa [29] (see, e.g., |2H Corollary 2.10], which is formulated slightly differently 
though the exact same proof implies both statements), where here and in what follows the length of 
paths/cycles will refer to the number of edges in them: 

Lemma 2.6. Let r be a positive integer, and let G = (V, E) be a graph where for every subset R C V 
with \R\ < r we have \Nq(R) \ R\ > 2|i?|. Then for any path P in G, denoting its length by h, 

(i) there is path P' of length h + 1 in G containing all vertices of P plus a new endpoint u ^ P, or 

(ii) there is a cycle C' in G of length h + 1 on the same vertex set as P , or 

(Hi) there are at least r 2 /2 non-edges in G such that if any of them is turned into an edge, then the 
new graph contains a cycle C' as above, i.e., an (h + l)-cycle on the same vertex set as P. 

Lemma 12.31 enables us to apply the rotation-extension technique to G' with r = n/c 2 . Let P be 
the path in G' such that V{G') \ P can be covered by I < n/\/Iogn cycles as per the conclusions of 
Lemma HT3l Let h denote the length of P (i.e., P has h edges) and let {C\, . . . , Cg} denote the cycles 
covering V(G') \ P. We now argue that either G' is Hamiltonian or there exist at least r 2 /2 edges - 
to be referred to as boosters — the addition of each of which would create either a Hamilton cycle or 
a path P' containing all vertices of P in addition to one of the Cj's. Thereafter, we will sprinkle the 
edges . . . ,eij to repeatedly absorb all £ cycles into P and then form a Hamilton cycle. 

To justify the above claim, examine the three possible conclusions of Lemma 12.61 above: 

(i) If the path P can be extended to a path P' ending at some vertex u E V(G') \ P, delete an edge 
to unravel the cycle Cj containing u into a path, and concatenate that path to P'. 

(ii) If there is a cycle C' on the same vertex set as P, either C' is a Hamilton cycle in G' as required, 
or the connectivity of G' would imply that C' is connected by a path to some u £ V(G') \ P. In 
the latter case, unravel both C' and the cycle Ci of u in the obvious way into a single path. 

(iii) Otherwise, there are at least r 2 /2 edges, the addition of each of which would lead to Case (jn]). 

Overall, letting G' = G' and G'j be the result of adding the edge to G'j_ 1 for j = 1, . . . , J, 
we observe that the above mentioned properties of G' are satisfied for each j (being closed under the 
addition of edges) and so encountering I + 1 < n/^/\ogn + 1 rounds for which is a booster in G'j_ 1 
would guarantee Hamiltonicity. 
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The proof can now be concluded from arguing that, for any j = 1, . . . , J, the probability that 
ei- is a booster in G'j_ 1 is uniformly bounded from below. Indeed, e% } is uniformly distributed over 
all missing edges in the induced subgraph of G'j_ 1 on Bk- Of course, V(G'j_ 1 ) = V(G^) = BkUCk 
so there are o(re 2 ) edges incident to Ck in G'j_ 1 . It follows that almost all of the boosters of G'j_ 1 
are edges whose both endpoints lie in B^, and the probability that belongs to this set is at least 
(1 — o(l))(r 2 /2)/('^ fe !) > (r/n) 2 > 1/c 4 . Hence, conditioned on the event J > n/(1001oglogn), 
the number of boosters we collect (formally defining every new edge as a booster upon achieving 
Hamiltonicity) stochastically dominates the random variable Y ~ Bin(re/(100 log log n), 1/c 4 ), which 
in turn satisfies ¥(Y > c'n/ log log re) > 1 — exp(— re 1-0 ^ 1 )) for some d > (that depends on c). In 
particular, Y > n/\flogn + 1 with probability 1 — exp(— re 1-0 ^ 1 )) and the proof is complete. ■ 

Proof of Theorem 12.21 The proof will follow the same line of argument used to prove Theorem l2.ll 
albeit in a somewhat simpler setting as we can utilize the Hamilton cycle in as the foundation 
of our 2-factor, and furthermore, we can afford to select the subset Bk of vertices — the bulk of the 
fc-core to be used for sprinkling — based on the information of H (in lieu of conditioning on future 
information from the random graph process). Namely, we will choose Bj. to be the vertex set of H^ k \ 
Recall that a key element in the proof of Theorem 12.11 revolved around controlling the random variable 
J despite its delicate dependence on Bk- As mentioned there, this issue would be circumvented should 
Bk be determined already by graph H, precisely our present situation. 

Let H ~ G{n, ret) for m > \(7k — l)n. Ignoring floors and ceilings for brevity here and throughout 
this proof, let ei, ... j e n /g be a uniformly chosen subset of the edges missing from E(H), and let G be 
the result of adding these edges to H. 

Suppose that is Hamiltonian (otherwise there is nothing to prove), let Bk = V(H^), and 
observe that thanks to Lemma 12.41 we already know that \Bk\ > 0.999re with probability 1 — o(re~ 3 ). 
To exploit the other conclusions of that lemma, we must first recover the graph G' as defined there. 

As before, we will condition on the graph G' as well as on the random variable J counting the 
number of edges with both endpoints in Bk, without revealing the identity of those edges. As we 
mentioned, the fact that Bk is determined by H will readily yield the sought lower bound on J . 
Formally, for each t = 1, . . . , re/8, expose whether the new edge et = uv (uniformly distributed out of 
all missing ones) has either u ^ Bk or v £ Bk, and if so, expose its endpoints u, v themselves. This 
process reveals the graph G 1 and random variable J, yet we see that each edge is uniformly distributed 
out of all missing ones (as we no longer have a conditioning involving the graph process at future times 
as in the proof of Theorem 12. ip . At all times we have at most m < n log n + re/8 edges in our graph 
(as per the scope of Theorem 12. 2p . therefore the probability that et has both its endpoints in Bk is at 
least ((°- 9 ^ 9n ^ _ 2relogre)/(2) > 0.998, with the last inequality being valid for large enough re. That 
is to say, the random variable J counting the number of such edges that we encounter) stochastically 
dominates a binomial variable Y ~ Bin(re/8, 0.998), whence standard concentration arguments imply 
that J > re/10 except with probability exponentially small in re. Finally, by the definition of the 
process above, each edge et that was accounted for in J is a uniform edge among all those missing 
from the induced subgraph on Bk at the end of the previous iteration. 

Applying Lemma [2.41 to G' = (V',E'), we find that with probability 1 — o(re~ 3 ), every subset X in 
V' of size \X\ < re/5 has at least 2\X\ external neighbors in G' . We claim that it is moreover connected 
with the same probability: Indeed, the only edges in E(G^) \ E{G') are edges in E(G) \ E(H) with 
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both endpoints in Bk, yet by assumption Bj~ is connected already in the subgraph H C G since 
is guaranteed to contain a Hamilton cycle. Therefore, the fact that is connected carries to G ' . 

We are now in a position to conclude the proof in the same manner used to prove Theorem 12.11 
Considering all points in V \ Bk as trivial cycles (of which there are strictly less than O.OOln since 
\Bk\ > 0.999n) and adding those to the simple cycle that goes through all vertices of Bk in we 
arrive at a 2-factor in G' with at most O.OOln + 1 cycles. By the expansion properties of G that were 
detailed above, we can derive from Lemma 12.61 that there are at least n 2 /50 boosters in G' . However, 
all but O.OOln 2 such boosters have both of their endpoints in Bk, therefore the probability that a new 
edge, uniformly chosen out of all edges missing from the induced subgraph on Bk, is a booster, is at 
least 1/50 (with room to spare). Seeing as G' is connected, sprinkling any such edge to G would 
increase the length of the longest path in G', and encountering O.OOln + 1 boosters would culminate 
in a Hamilton cycle through the vertices of V . Iterating this procedure for J > n/10 rounds, the 
number of boosters encountered before Hamiltonicity is achieved stochastically dominates a binomial 
random variable Z ~ Bin(n/10, 1/50). Since Z concentrates around its mean of 0.002n we deduce 
that a Hamilton cycle will be formed except with probability exponentially small in n, as desired. ■ 



3 Expansion and factors in cores 



This section contains the proofs of Lemmas 12.31 and 12.41 which address properties of the random 
graph near and beyond the A:-core threshold. These two regimes are studied in Sections 13.11 and I3.2| 
respectively, yet we begin with a straightforward fact on the expansion of small sets, to be used in 
both regimes. 



Claim 3.1. Fix k > 15, let G ~ G(n,p) for p = X/n with 1 < A < log 2 n. With probability 1 



on 



the following holds for any subgraph G 
have \Nq/{X) \ X\> 2>\X\ for every 
every X C V of size \X\ < |A~ 5 / 3 n. 



(V',E') C G (not necessarily induced) with 5(G) > k. We 
have \N G ,(X)\X\ > 3\X\ for every X C V of size \X\ < f A" 15 / 7 n, whereas \N&(X)\X\ > 2\X\ for 



Proof. Consider some X C V of size at most On that does not expand to at least b\X\ external 
neighbors in G for some fixed b > 2, and let Y C V denote the external neighbors of X in G . The 
fact that 5(G) > k implies that out of the ('^') + |X||K| potential edges incident to X, at least A;|X|/2 
edges exist in G 1 , hence also in G. Whenever b < k/2 — 1, the probability of this event is thus at most 



< 



< 



E 

x<8n 
|251ognJ 



f + bx 2 
kx/2 



kx/2 



< 



E 



E [»■ 



-fc/2+b+l+o(l) 



x=l 



x<9n 

I9n\ 

+ E 

x= [25 log n 



en (en \ 1 



bx J 



(2b + l)ex 
k 



n 



(e/6) 



6+1 



b b 



■((2b + l)eX6/k) 



(3.1) 



Taking 6 = 3 and 9 = |A _15//7 , the first summation is o(n~ 3 ), whereas for k = 15 the occurrences 
of A in the second summation cancel (and for A; > 15 the exponent of A will be negative), thus reducing 
that summation into at most 



L|A- 15 /7„j 

E 

x= [25 log n] 



(W3) 4 
27 



3e\ 2 
~k) 



L|A- 15 /7 n j 

s E « 

x= [25 log n\ 



•a/3 



o(n 



-8\ 



12 



where we used the fact that (7e / 7 3)4 (3e/fc) fc / 2 < § for any k > 15. 

Similarly, taking 6 = 2 and B = |A _5//3 , the first summation in (|3.ip remains o(n -3 ) and again we 
can omit all occurrences of A in the second summation for k > 15, reducing it into at most 

L|A" 5 /3 n j 



L|A- 5 /3 n j 

'E 

x= [25 log n] 



(5e) 3 
256 



fcn a; 

2 



£ E « 

x= [25 log n] 



-s/8 



o{n- A ) , 



as required. 



3.1 Properties of sparse random graphs near the core threshold 

Throughout this subsection we will restrict our attention to Q(n,p) in the regime p = 0(l/n). The 
next claim establishes that in any induced subgraph of minimum degree k — 1, sets that are large 
enough — namely, ones whose size is at least \fn — have many external neighbors. This can be 
established up to a linear scale, e.g., for sets up to size n/15, solely based on an assumption that the 
minimum degree of the graph under consideration (a subgraph of our random graph) is at least k. 
However, as we will later see, it will be imperative to obtain this estimate for larger sets, to which end 
we will rely on additional properties that are available to us in the framework of Lemma [ 



Claim 3.2. Fix k > 15 and let G ~ Q(n,c/n) for some fixed k < c < The following holds with 
probability at least 1 — O(0. 8^). If G' = (V',E') is an induced subgraph of G on \V'\ > |n vertices 
and H = (V',E' \ (E\ U E2)) where E\ is a matching and ji^l < n/ log log n such that the minimum 
degree of H is 5(H) > k — 1, then every X C V of size y/n < \X\ < In satisfies \Nh(X) \ X\ > n 1//3 . 

Proof. First consider the range yfn < \X\ < Bin for 6»i = Jj. Exactly as in the proof of Claim 13. 1\ the 
probability that there exists a subgraph H of G with minimum degree k — 1 (not necessarily formed 
by deleting the edge sets E\ and E2 as above) within which we can find a set X C V of size x such 
that y/n < x < Bin and all of its external neighbors belong to some Y C V of size y = n 1 / 3 is at most 



E 

\/n<x<.9\n 



en ( en \ vl x ( e(x + 2y) ^ ( c 



x V y J V k — 1 / \n 



fc-i _ fc -i 
2 



(3.2) 



where in comparison with (|3.ip here we let b assume the role of y/x. Examining the base of the 
exponent in each summand we see that (en/y) y / x < exp(n _1 / 6+ °( 1 )) = l + o(l) and similarly the other 
appearance of y are asymptotically negligible. Since k < c < |fc < |(fc — 1) the mentioned expression 
is at most 



k-l 

en ( 5ex\ 2 , .„ . „ , . fc-i 



(1 + o(l))— — < (e/B 1 + o(l)) (5e0i/2)— < 0.8 , 
x \ An J 

where the first inequality used the fact that x < B\n and the second one holds for any k > 15. The 
occurrence of a bad set X in this range of x therefore has an overall probability of 0(0.8^). 

Now take Bi = y. To treat subsets X of size B\n < x < &2n we will take advantage of the additional 
hypothesis about the edge set of H. For a given choice of V, X, Y such that all external neighbors of X 
are confined to Y (whose size we recall is y = n 1//3 ), there can be at most \X\ + \E2\ < |X| +n/ log logn 
edges between X and V'\(XL)Y) in G. The variable T counting the number of such edges has a law of 
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Bin(x(n' — x — y),p), and in particular its mean is x(n' — x — y)p > x(|n — |n — o(n))p > (c/2)x > 7x 
for large n, since c > k > 15. Thus, writing 5 n = 12/ log log n so that x + n/ log log n < (1 + <5n)£ ; 

P(r < (1 + 5 n )x) < (1 + S n )x( X ^f ~ X ~ y A p (l+6n)x e -px(n'-x-y)+p(l+5 n )x 

V (i + d n )z / 
< (1 + o(l))ec(l - x/n)e- c ( n '- a ')/ n j * < [(1 + o(l))ce 1 - c / 2 

where the last inequality plugged in the facts n' > |n and x < |n. Armed with this expression we 
revisit (|3.2j) and revise it to incorporate the above probability, albeit at the cost of accounting the ( 
possible choices for V', so the probability of encountering the mentioned set X becomes at most 



E 



E 



6in<x<e 2 n ± n < n '<n 



en 



ecx 



x V (k — l)n 



fc-i 

2 ce 1 -' 2 



s E 

9in<x<92n 



(I + o(l))e 2+h ^) 



hi n 



ecx 



x V (A; — l)n 



fe-i 

2 ce- c / 2 



where we used the fact that ^2 i<an (™) < exp(/i(a)n) with h(x) = — xlogx — (1 — x) log(l — x) being 
the entropy function, and as argued before, the terms involving y in (|3.2p are easily absorbed in the 
(1 + o(l))-factor. Writing c = c/(k — 1) and x = x/n (so that c < | and 6*i < x < #2) we can rearrange 
the base of this exponent in each summand and find that it is asymptotically equal to 



xe fc - 3 Vs 



ha)/x\ — 



ce 1 - 5 ^) 2 <e 2 (xe^'^ k ~ 23 



k— 1 

5\ ~ 



using the facts that A; > 15, that c* 31 < (§(fe— l))^ 31 < 5/3 for A: > 15 and that the function t h-> te _i 
with i > has a global maximum at i = 1. Similarly, the function 1 1— > te a l l with a, t > is increasing 
for t > a and asx>^ = ^>^/i(|) this implies that the above expression is maximized at x = |, 
which case it evaluates into 



m 



21 



< 0.9 



for any k > 15, and so the probability of encountering the mentioned set X in this range is 0(exp(— an)) 
for some absolute constant a > 0, which completes the proof. ■ 



In what follows, for a graph H and a subset A of its vertices let c Q {H \ A) denote the number of 
components of odd size in the subgraph obtained by deleting the vertices of A from H. The above 
claims are already enough to imply a bound on c Q (H \ A) for small subsets A, which will later used 
as part of the proof of Lemma [ 



Corollary 3.3. Fix k > 15 and let G ~ Q(n,c/n) for some fixed c such that max{/c,20} < c < |A;. 
Then with probability 1 — o(n -3 ), for any induced subgraph G' = (V',E ! ) C G of minimum degree 
at least k on \V'\ > |n vertices and any H = (V',E f \ (E\ U E2)) such that E\ is a matching, 
\E 2 \ < nj log log n and 8(H) > k-1, we have c (H\X) < \X\ for all X C V of size \X\ < ^c^ /7 n. 
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Proof. Let = § c~ 15 / 7 , and assume that the events stated in Claims [3711 and ET21 hold. In particular, 
the graph H under consideration satisfies that every set S C V of size at most #n has at least 
3\S\ — \S\ = 2\S\ external neighbors (deleting the matching E\ can destroy at most |S| external 
neighbors), and in addition it has at least n 1 / 3 external neighbors if its size lies between yfn and |n. 

Suppose first that logn < |X| < (0/2)n and let {C\, . . . , C s } be all the connected components in 
H \ X of size \d\ < On. Clearly (N H (d) \ Q) C X, but at the same time \N H (d) \ d\ > 2\d\ 
hence we must have |Cj| < |X|/2 for all i. Suppose that D is a minimal union of Cj's such that 
\D\ > |A|-±log n. Since \X\ > logn we have \X\ < %\D\. Using that \C{\ < \X\/2, by the minimality 
of D, we also have \D\ < \X\ - |logn + \X\/2 < 3|X|/2 < On. Therefore \N H (D) \ D\ > 2\D\. 
However, at the same time \N H (D) \ D\ < \X\ < as there are no edges between distinct C{,Cj, 
contradiction. Consequently, there cannot be more than \X\ — ^logn components of size at most 
On (in fact, we showed that the union of all such components has cardinality at most \X\ — |logn). 
When added to at most 1/0 components of H \X whose size exceeds On we readily reach the required 
inequality c Q (H \ X) < \X\ in this case. 

Next, consider the case (0/2)n < \X\ < On. Let us repeat the above argument, this time only 
collecting components of size at most logn. Taking D to be a union of such components so that 
l\X\ - logn < \D\ < f|Jf| we see that \Nh(D) \ D\ < \X\ < + | logn, which for n large enough 
contradicts the expansion assumption \Nh(D) \ D\ > 2\D\. We deduce that there are at most ^\X\ 
such components, and adding at most n/logn = o(|X|) components of size larger than logn gives 
c (H\X) < \X\. 

If 3 < \X\ < logn we observe that by our assumption there are no connected components of H\X 
of sizes between -y/n and |n. Consider the components C\, . . . ,C S whose sizes are at most yjri. Since 
there can be at most 3 components of size larger than |n, it follows that if | U« Cj| < \X\ — 3 then 
accounting for all such components results in c (H \ X) < \X\, as required. Otherwise, there exists a 
minimal union D of Cj's such that \D\ > \X\ — 2. Being well in the range of the expansion hypothesis 
(this union has size at most y^logn), the fact that \X\ > \Nh{D) \ D\ > 2\D\ implies that \X\ < 4. 
However, in this case necessarily there are no small components in H \ X, since otherwise each d 
would have to be size at most |X|/2 by the expansion property, thus a vertex v £ Ci would have 
degree at most \d\ - 1 + |X| < - 1 < 5 (recall that 5(H) >k—l> 14). 

Finally, if < \X\ < 2 then no component of H \ X can have size less than |n (recall that 
such components had to satisfy |Cj| < | J\T|/2 and are thus empty by the discussion in the previous 
paragraph). At the same time, the probability that two subsets A and B of size |n each have no edges 
between them in G except for a matching (corresponding to E\) is readily bounded by the probability 
that the degree into B of every vertex in A is at most 1, translating to an upper bound of 

2 2 

(1 - p) 2 ^ ^ < [el h ^2 2 (jc+l- o(l)) e-y c ) ^ < (0.99) 2 "/ 7 , 

where the last inequality holds for any c > 20. One can now repeat this calculation, this time taking 
into account the edges of E2. Since their number (n/ log logn) is negligible in comparison with the 
sizes of A and B, they contribute a factor of at most ( ^ n ^V ) v n l log log 11 = in the above estimate 

' J \n/ log log n) r 

and hence the conclusion remains valid. This concludes the proof. ■ 
A prerequisite for treating sets of larger size, which will also be useful to separately bound the size 



n 



7?n 
|n 



-np + (1 — p) 
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of the 2-factor in G once we establish its existence, is the next estimate on the maximal number of 
vertex-disjoint cycles in G. 



Claim 3.4. Fix c > and let G ~ Q(n,c/n). Let Y count the maximal number of vertex- disjoint 
nontrivial cycles in G. Then F(Y > ^re/ydog n) < e _v/ ™ for any sufficiently large n. 

Proof. Let Yq be the maximal number of vertex-disjoint cycles in G such that each cycle is of length 
at most L = 4y / logn. Since Y < Y + n/L it suffices to show that P(y > n/L) < e~^. Observe that 

EY < £ ( H ) (s - 1)! (-) S < £ c 3 = = n°« < n/(2L) , 

s<L 71 s<L 

and consider the vertex-exposure martingale for Yq in G(n, c/n). Its Lipschitz constant is clearly equal 
to 1 since the addition of a new vertex to a graph retains every existing subset of disjoint cycles 
whereas it can increase the cycle count by at most 1. Hence, Hoeffding's inequality gives 

P (Y > n/L) < P (Y - EY > n/(2L)) < e" ^ 1 ^ < e'^ , 

where the last inequality is valid for large enough n. ■ 

The final ingredient needed for proving Lemma 12.31 is the following claim, which verifies Tutte's 
condition for large sets and complements the range of sets that were covered by Corollary 13.31 

Claim 3.5. Fix k > 15, let G ~ G{n, c/n) for k < c < and let G' = (V, E') be an induced subgraph 
of G with 5(G') > k. Let H = (V',E' \ (Ei U E2)) where E\ is a matching and l^l < n/ log log n. 
Then with probability \-0{e~ n ' 40 ') we have c {H\X) < \X\ for all X C V such that \X\ > fc" 15 / 7 



n. 



Proof. Set x = \X\ and consider first subsets of size 



3n n , 



Observe that by Claim 13.41 with probability at least 1 — 0(e~v n ) the maximal number of vertex 
disjoint cycles in G, and hence also in H, is at most n/\/log n. In particular, there can be at most 
that many connected components of H \ X containing cycles. Disregard these components, as well 
as at most n/ log log n additional components incident to the edge set E^. Altogether we discarded 
o(n) components, hence if c Q {H \ X) > \X\ for a linear X as above then we will be left with at least 
j|x odd components in H \ X for large enough n. Let Y be a set comprised of an arbitrary leaf from 
each of these components (which are trees by construction). The vertices of Y have degree at least 
k in G' and as such they have at least k — 2 neighbors in X in the graph H (plus a single neighbor 
in their tree component of H \ X and possibly one additional neighbor in E\ ) . The probability that 
there exists such a pair (X, Y) in G is at most 



n / C / \ / \ „ _ 15 „ n / c n n . . 31 



^ U i hBin(x,c/n)>fc-2)] ie < ^ 

3 _ 15 ^ / ^16 / „ _ 15 

-jc ~7~ n x=jc T n 



^ (^) i0 P(Bin(x, c/n) > k 



Ki 
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Plugging in the well-known estimate (see 0115]) that if Z is a binomial variable and a > then 
P (Z - EZ > a) < exp (-0(a/EZ)EZ) for 0(x) = (1 + x) log(l + x) - x, we get that 

1! ( J ec i5/rA 15 P(Bin(X) c/n) > fc _ 2) < 16 ( 7 (7jt/3)15/7 \ * (Jfc _ 2) _ (fc _ 2)efe _3 

£2e6l5 (^2)"- 2 ' n ( 1 + ^2) 5 (^2)" 7 < - 95 - 

where the first inequality uses the fact that c < and xc/n < 1, while the last inequality holds for 
k > 15. Combined with the previous inequality this implies that the probability of encountering some 
X violating Tutte's condition in H and with size as in (|3.3|) is at most exp(— c'n) for some fixed d > 0. 

We will now handle the range 

n . n , 

- <x<(k-3)-. 3.4 
c c 

Given a candidate subset X of size x for which c {H \ X) > x we again consider the above defined set 
Y (comprised of an arbitrary leaf from each tree component of H \ X not incident to any of the edges 
of E2). Previously, we only used the fact that each v € Y has at least k — 2 neighbors in X. Now we 
will use the fact that one can take Y to have size (1 — o(l))x, and in addition Y is an independent set 
in H (its vertices fall in distinct components of H \ X). The probability that a given subset Y of size 
y in G spans precisely t edges is at most 

(§!,«(!_„)©-.< l(M>L\' (1 - p p =:Ct , 

which is maximized at t = (|) > (1 — o(l))y/2, with the last inequality due to x > n/c. Of course, 
the values of t we are considering cannot exceed y/2 as the edges of G in Y are only allowed to form 
a matching (recall that Y is not incident to E2), thus by Stirling's formula 

E 6 < (1 + ,) i±S (ilk ^ _ < ((e + 0(1)M <J-°«>* . 

Combining this estimate with the number of choices for the sets X, Y and with the probability that 
each vertex of Y has at least k — 2 neighbors in X, we arrive at the following upper bound on the 
event of encountering X, Y as above: 

(fc— 3)n/c 

E 



x=n/c 



(l + o(l)) (^J~P(Bin(x, c/n) > k - 2) y/epxe-P* 



(3.5) 



where within each summand we could safely absorb all the o(l)-terms in the exponents into the 
o(l)-term of the leading constant since the base of each of these is uniformly bounded. 

Notice that for £ > k — 2 we have 

P(Bin(x,p) = I + 1) x — £ p ^ ^ _)_ (1)) P x 



P(Bin(x,p) =£) £ + ll-p~ K w/ fc-l' 

which by our hypothesis on x is at most (1 + o(l))(/c — 3)/(k — 1) < 1 for large enough n. It follows 
that 

l+o(ll 1 + o(l) (r)r) k ~ 2 

P(Bin(x, c/n) > k - 2) < -J-^W{Bin(x, c/n) = k — 2) < -— Wjl Lo^ ' 

1 fc-i 1 fc-i ^ K z >- 
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and plugging this in f)3.5j) gives an upper bound of 

\fc-2 



> + *»(?) rrV^-v^^ 



x=n/c 



Writing = px and c = c/fc, we will denote the (asymptotic) base of the exponent above by 

#• / \ ~i — -z-4-- t-— - /c 2 (/c — 1) 
f k (z) = c*e - (fc _ 1 _ z)(fc _ 2)! - 

With this notation we aim to bound fk(z) away from 1 for all 1 < z < k — 3 and 1 < c < X. 
It is easy to verify that the local extrema of fk are precisely the roots z\ < z£* of the quadratic 
3z 2 + (12 — 5k)z + (2/c 2 — 9A; + 7), where z\ is a local maximum of fk and z£* is a local minimum. 
Furthermore, since 

zT = -k - 2 + -y^ 2 - 12/c + 60 > k - 3 
6 6 

(the last inequality being valid for any k > 0) we can conclude that throughout the range 1 < z < k — 3 
the function /fc(z) is maximized at 

4 = -jfe -2- -\/A; 2 - 12/c + 60. 
6 6 



Observe that z^ +1 > z\ and 



2 4 2 
-fc- - < zl < -k- 1. 

3 3 ~ k ~ 3 



In particular < |fc — | and < — ^ < 1. 

Using the bound z* k < 1 in the expression l/(k— 1— 2;) we immediately get that f k (z k ) < gk(z^) 



where 



5( fc (-z) = 3c 2 e ^+2^ 2 



3 , 1 5 



.fc-X*(fc-l) 



(*-2)! ' 

For ease of notation, let z = z^ and z\ = z^ +l . We have 

9k(zo) \z J {k-lf 



Observe that 



and therefore 



Sfc+i^i) < ^ k + l < §(fc-^) fc + l < 2 + . _ 



£?fc(2 ) ~ (/c-l) 2 - jfe-1 A: — 1 — 3 V k-lj 6 
where the last inequality is valid for any k > 15. It therefore suffices to show that gk{z k ) is bounded 
away from 1 for k = 15. Indeed, for = 15 we have 

*J 5 € (10-!,10-§) 

and substituting c by its maximal value of | gives 
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This establishes the desired statement for the range (|3.4p provided that 1 < c/k < |, that is, 
c {H \ X) < \X\ for all sets X whose sizes are between n/c and (k — 3)n/c except with probabil- 
ity exponentially small in n. 

To conclude the proof it remains to handle x > (k — 3)n/c. Observe that if c < 2(k — 3) then the 
range (|3.4p goes up to x < n/2, beyond which c Q (H \ X) < \X\ trivially holds. We are thus left with 
the case 2(k — 3) < c < |& and (k — 2>)n/c < x < n/2, where we claim that w.h.p. there will not exist 
a set Y as above which is an independent set in H. Indeed, the probability that Q(n, c/n) will contain 
a set Y of size (1 — o(l))(k — 3)n/c < y < n/2 where the edges form a matching (recall that Y is not 
incident with E2) of size £ < y/2 is at most 



e<y/2 



£<y/2 



„ u n (ey\U/v (2£c 



ec + o(l) (eycy/y ik _ 3)/2 \ y 
k-3 V U ) 



-py/2 



It is easy to verify that the function x i— > (ax) l l x is decreasing in (e/a, 00) for any a > 0. In particular, 
for a = ec/4 > e/2 the maximum of {ay/Iffy over all y/£ > 2 is at most \/2a; hence each of the 
summands in the right-hand-side above is at most 



ec + o(l) [ec p -( k -3)/2 
k- 3 



< 



c 3 / 2 e 3 + o(l) 
V^CA; - 3) 



-fc/2 



< 




'<G) 









where the second inequality uses that c < Zk and that (I) 3 ^ 2 - 



-3; V2(fc-3) 

We are now in a position to prove the main result of this subsection. 



< 5 for k > 15. 



Proof of Lemma 12.31 The fact that G' is connected (as stated in Item (j]} of the lemma) follows 
from Corollary 13.31 for a choice of X = 0. Similarly, the expansion of sets of size at most n/c 3 (as 
stated in Item (jn|) of the lemma) is an immediate consequence of Claim 13.11 (with plenty of room to 
spare, as |c 5//3 < c 2 already for c > 2). It remains to establish the conclusion on the existence of a 
factor consisting of a path P and at most n/yTogn cycles in G' . 

The combination of Corollary 13.31 and Claim 13.51 shows that existence of a near-perfect matching 
(one that misses at most one vertex) in G' with probability at least 1 — o(n~ 3 ). Indeed, in the 
case where |V'| is even, the existence of a perfect matching in G' follows from the celebrated Tutte 
condition [32] since c (^H \ yl) 5^ |^4| for every A CZ V'. When |V| is odd, satisfying Tutte's condition 
for every ^4^0 will imply a near-perfect matching via Berge's formula (see, e.g., \22\ §3.1.14]) since, as 
discussed above, G' is connected with probability 1 — o(n~ 3 ). Denoting this matching by M\ we then 
re-apply Corollary 13.31 and Claim [331 on H = (V',E' \ Mi) to extract another near-perfect matching 
M2 with the same success probability. The factor formed by M\ U M2 is either a union of cycles 
(possibly with one isolated vertex which we count as a trivial cycle) or a union of cycles in addition 
to a single path. Moreover, the number of cycles is at most n/^/log n except with exponentially small 
probability thanks to Claim [3~H as required. ■ 
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3.2 Properties of sparse random graphs beyond the core threshold 

The main element needed for the proof of Lemma [231 is the expansion properties of sets in our random 
graph whose sizes exceed the one addressed in Claim 1370 

Claim 3.6. Let G ~ Q(n,p) for p = X/n with 32 < A < \fn. There exists an absolute constant a > 
such that with probability at least 1 — 0(e~ an ) we have \Nq(X) \ X\> \\X\ for every X C V(G) such 
that iA~ 5 / 3 n < \X\ < n/5. 

Proof. Let 0\ = 9\(n) = ^A~ 5 / 3 and 02 = 1/5. Consider a potential subset X of x vertices such that 
9\n < x < 02n and yet \Na(X) \ X\ < b\X\ for b = §. Letting x = x/n for brevity, the probability 
that there exists such a set X is at most 

/ n \({b + l)x\ (n _ {b+1)x) ( en \ {b+1)x Hl ±_ Kb+1)x ± x(n _ (b+1)x) 

\(b + l)x)\ x J [ P) ~ \(b+l)x) 



\ 6+1 x I 

The unique extremum of f(t) = e xt /t is a minimum at t = 1/A, thus for 9\ < x < 62 we have 
f(x) < max{/(^i), f(02)}. One can verify that 

m) = 2A 5 / 3 e A " 2/3 / 2 < 5e A / 5 = f{9 2 ) 

for any A > 32, and in particular the above probability is at most 

7 7 

10 e l+/ l (2/7)-(2/7)A+A/5A ' < ( Hgl+log 2-\/12\ 2 * < 2 ~2x 



7 J ~ V 7 

where in the last inequality is again valid for A > 32. Summing over 0(n) values of 6\n < x < 02n 
now gives the desired result. ■ 

Observe that the claim above analyzed expansion in the entire random graph, while we will be 
mostly interested in the expansion properties of sets within the fc-core. To this end we provide the 
following estimate on the size of core. 

Claim 3.7. Fix k > 15, let G ~ Q(n,p) for p = X/n with (7k — l)/3 < A < yfn, and define 

7 = 7 (fc,A) = ( 1 + S)e k (^j^j e-^- 5)x for 5 = 1Q- 3 . (3.6) 

Then 7 < 10 -3 and \V(G^)\ > (1 — 7)71 with probability at least 1 — 0(e~ an ) for some absolute 
constant a > 0. 

Proof. Consider a potential set S of s = [7"<J vertices that do not belong to the fc-core. We note that 
7 is decreasing in A in the range A > k and so 7 (&, A) < 7 (fc, Ao) for Ao(A;) = (7k — l)/3 = — 1) + 2. 
At this value of A we have 

7 (fc, Ao) = (1 + ^e 1 -^ (f (l + ^^y) fNij , (3.7) 



20 



and as the last factor is less than 1 for any k > 4, the entire expression is decreasing in k and so it is 
upper bounded by its value at k = 15, which, as one can easily verify, is strictly less than 10 -3 = 5. 

The iterative construction of the /c-core via successively deleting vertices of degree less than k 
reveals that necessarily there are at most (k — l)s edges in the cut between S and V(G^'). As the 
expected number of edges in this cut is s(n — s)p > sA/2 > s(k — 1), the probability to encounter such 
a set S is at most 



(«(fc-l) + l) 



) + 1) Q *)y(fc-l) (1 _ p) «(n- 8 )-.(fc-l) 



< 



se p(n-s) 



e(n — s)p 
fc-1 



fc-i 



e + o(l) 



7 e 



eA 



A(l- 7 ) V _ 1 



fc-1 



l + o(l) 



(1 + <5) e( <5 -^) A 



< e 



for some absolute constant a > 0, where the second equality substituted the definition of 7 in (I3.6P 
and the last inequality justified by the fact that 7 < 5. ■ 

Proof of Lemma 12.41 Let H ~ Q(n,m) for ^(7k — l)n < m < nlogn. Note that the edge density 
in H is p ~ A/n for (7fc — l)/3 < A < 2-^^p, and in particular for k > 15 we see that A satisfies the 
hypotheses of Claims 13.61 and 13.71 As per the conclusions of these claims, every set X C V of size 
±A~ 5 / 3 n < \X\ < n/5 has \N H (X) \ X\> ||A|, and \V{H^)\ > (1 - 7)n for 7 < 0.001 as defined 
in (|3.6p . except with probability 0(exp(— an)) for some absolute constant a > 0. Letting G be the 
result of adding n/8 uniformly chosen new edges to i?, its edge density is asymptotically (A + \)/n, 

every subgraph F C G with 



on 



and appealing to Claim 13.11 we find that with probability 1 
minimum degree k has \Np(X) \ X\ > 2\X\ for any X C V(F) of size at most |A _5//3 n. Condition on 
these properties of H and G, from which we will now derive the required expansion properties of G' . 

By definition, the graph G' has V' = V(G^) as its vertex set, and its minimum degree is at least 
k, since every vertex u G V'\V{H^) has the same degree in G' as it has in G^ (at least k), whereas 
every u G V(H^) has degree at least k already within C G' . Thus, thanks to our conditioning, 
every X C V such that \X\ < |A" 5 / 3 n satisfies \N G ,(X) \X\> 2\X\. 

It remains to treat sets X C V of size |A _5//3 n < |A| < n/5. We claim that the proof will be 
concluded once we show that 



7(M)<V 5 / 3 
5 



for any A > (7fc — l)/3. 



(3.8) 



To see this, recall by our assumptions, for any set X in this size range, its set of external neighbors 
Y = N H (X)\X satisfies \Y\ > ||A|. Out of this set Y, all but at most 7n vertices belong to V 
(and hence belong to Nc(X) \ X, as the induced subgraph of H on V is a subgraph of G'). Thanks 
to (13. 8p we have 7n < |A|/2, thus in particular \Nqi(X) \ X\ > 2\X\, as desired. 

Turning our attention to (|3.8p and recalling the definition of 7 from (|3.6p . we need to show that 



(1 + 5) 



eA 



fc-i 



,l-(l-S)X . ° A 5/3 K 1 



for 5 = 10" 



Since the function x 1— > x k 1+5 / 3 exp(— (1 — 5)x) has a unique extremum in the form of a global 
(fc — 1 + §)/(! — S) < I k — 1, it suffices to establish the above inequality at 



maximum at x 
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Ao = |fc — | = |(/c — 1) + 2. Following (|3.7p . the left-hand-side of the sought inequality then becomes 

(1 + 4) . 5 ■ 2 V3 (, + l (k _ W (I (, + _^_) ^Mtf)" . 

Denoting this expression by we need to show that < 1 for all A; > 15. We see that for any k >2, 

V 7* -»/ \3 A 7(t-l)A (fc-l)(7*-8) 

Clearly each of the three factors that depend on k is decreasing in k (with the last of these being 
strictly less than 1 and raised to a power growing with k). In particular, for any k > 5 we have 

s/ < §• < 0.95 < 1 . 



6c- i £4 

Equivalently, is strictly decreasing in A; in the range k > 5, and so for any A; > 15 we have 

£ fc < £15 < 0.95 < 1 , 

thus establishing (|3.8|) . as required. 



4 Packing edge-disjoint Hamilton cycles 

In this section we prove Theorem [2j Since several of the ideas and techniques used here are rather 
similar to those applied beforehand, we will allow ourselves to be somewhat brief at times. Also, we 
will borrow extensively from the notation and terminology of the previous sections. 

First of all, let 



Similarly to Theorems 12.11 and 12.21 the following two theorems cover the critical regime (which is 
narrower here) and the supercritical regime, respectively. 

Theorem 4.1. Let k be a large enough constant, and let G ~ Q(n,m) for m = cn/2, with c satisfying 
k < c < k + 100-v/A; log k. Then with probability 1 — o(n~ ~ 2 ) either is empty or it contains a family 
of k\ edge- disjoint Hamilton cycles. 

Theorem 4.2. Let k be a large enough constant, and let H ~ Q(n,m) for m = cn/2, with c = c(n) 
satisfying k + 99\/k log k < c < 21ogn. Let G be the graph obtained by adding n uniformly distributed 
new edges to H . Then the probability that H^ k ' contains a family of k\ edge disjoint Hamilton cycles, 
and at the same time G^ does not contain a family of k\ edge-disjoint Hamilton cycles, is o(n~ 2 ). 

The above two theorems, combined with the classical result of Bollobas and Frieze that the 
random graph G ~ Qin, m) with m = ^ (log n + (k — 1) log log n + uj n ) and any diverging increasing 
sequence uo n w.h.p. contains \k/2\ edge-disjoint Hamilton cycles, clearly establishes Theorem [5J 

In our proofs we prefer to work with the binomial random graph model G(n,p) instead of Q(n, m); 
standard arguments about the essential equivalence of the models can be invoked to transfer our claims 
to the model Q(n,m). 

We start with the following lemma, guaranteeing typical local expansion in subgraphs of minimum 
degree at least 5 in sparse random graphs. 
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Lemma 4.3. Let G ~ Q(n,p) for p = c/n with cq < c(n) < 21ogn. If cq is large enough, then with 
probability 1 — o(n~ 2 ) every subgraph H of G of minimum degree 5(H) > 5 has the following property: 
every subset X C V{H) of size \X\ < n/c 10 satisfies \N H (X) \X\> 2\X\. 

Proof. First observe that, for any given constant Co > 0, with probability 1 — o(n~ 2 ) every set 
X C V(G) with \X\ < Co vertices contains at most \X\ + 2 edges. Indeed, the probability that there 
exists a subset violating this claim is at most 

E ( H ) (^K)p 1+3 = 0(n c °p c »+ 3 ) = o(n~ 2 ) . 
i<C W V + 3 / 

Now we argue that typically in G{n,p) all larger sets are still fairly sparse. Specifically, we claim that 
there exists a large enough Co such that with probability 1 — o(n -2 ) every set Z C V(G) of cardinality 
Co < \Z\ < 3n/c 10 spans at most 1.2\Z\ edges in G. Indeed, the probability that there exists a set Z 
of cardinality \Z\ = z violating this claim is at most 

2=Co 2 = Co Z = C() 

log n 3n/c 10 
< £„-0.i* + [l0c~°- 8 ] z = O (n- 2 ), 

z=Co 2=log n 

for c > Co large enough and z > Co large enough. 

Now take Co < \X\ < n/c 10 and let Y = Nh(X)\X . Since all degrees in H are at least 5, we deduce 
that e H {X,Y) > 5\X\-2e H (X), and thus e H (XUY) = e H (X) + e H (X, Y) > 5\X\-e H (X) > 3.8\X\. 
If \Y\ < 2\X\, then \XUY\ < 3\X\ < 3n/c 10 and e H (XUY) > 3.8\X\ > 1.2\XUY\ — a contradiction. 

For the sets X of constant size \X\ = x < Cq, if \Njj(X) \ X\ < 2\X\, then there is a set Y of 
cardinality \Y\ = 2x — 1 with Nh(X) \ X C Y. Since all degrees in H are at least 5, and X spans at 
most x + 2 edges, it follows that en(X U Y) > 5x — e(X) > 4x — 2. The probability that there exists 
a pair X,Y with \X\ = x in Q(n,p) is bounded from above by 




for x > 4. For simple union bound shows that the probability of the existence of a pair 

X, Y as above is at most n~ x ' = o(n 2 ) and for x = 1, 2 the sought claim follows from the 

minimum degree assumption. ■ 

We will also need that typically in the supercritical regime the A;-core contains a proportion of 
vertices rapidly approaching 1. 

Lemma 4.4. Let G ~ Q(n,p) for p = c/n with k + 99a/ fc log fc < c(n) < 21ogn. 7/ k is large enough, 
then with probability 1 — o(n~ 2 ) the k-core satisfies \G^\ > n(l -c" 200 ). 

Of course, there is nothing new in this lemma, and the estimate on the size of the A;-core can easily 
be improved. We present its (straightforward) proof here mainly for the sake of completeness. 
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Proof. Let 

t = n/c 200 . 

Construct the (possibly empty) fc-core by peeling off repeatedly vertices of degree less than k. If this 
process lasts for t steps, then G contains a subset Vo of |Vq| = t vertices such that all degrees from Vo 
to V — Vq are less than k. For a given Vq and a given v 6 Vo, the degree of v to V — Vo in Q(n,p) is 
Bin(n — t,p), and by Chernoff's inequality (see, e.g., [2]), 

( ((n-t)p- k) 2 \ ( ((n-t)p-kf 

-t,p) <k) < exp - IF ' < exp ' 



< 




2(n — t)p J \ 2np 



-199 



2c 

One can verify easily that for c > k + 99\/klogk and large k it holds that 

( c _ c -199_ fc ) 2 

^ ^> 3001ogc, 

and thus p < c -300 . Hence the probability of the existence of a set Vq as above is at most 



(:y< (t)v <(«-"»)■ 



for large enough c (or large enough k, as c > k). If no such set Vo exists, then clearly the fc-core of G 
has at least n — t vertices, as required. ■ 

For the critical regime, the analog of Lemma 12.31 is now the following statement. 

Lemma 4.5. Fix k large enough and let G ~ G(n,m) with m = cn/2 and k < c < k + WO^k log k. 
Then with probability 1 — o(n~ 2 ), for any G' = (V',E ! ) C G with minimum degree 5(G') > k such that 
V' = V{G^) andE{G^)\E' contains at most n/ log log n edges, the graph G' contains a (2ki) -factor. 

The proof of this lemma can be derived by repeating the arguments of Pralat, Verstraete and 
Wormald [30] and of Chan and Molloy |10| . They proved that for large enough k, upon creation, the 
A:-core with high probability either contains a (k — 2)-factor or is (k — 2)-factor-critical [30], or 
even contains a {k — l)-factor or is {k — l)-factor-critical [10]. (A graph G is t-factor-critical if for every 
v £ V(G), the subgraph G — v contains a t- factor.) Both these papers use Tutte's factor theorem to 
derive the likely existence of a required factor. As one can anticipate (and we indeed verified), altering 
the /c-core slightly (by deleting at most n/ log log n edges) while preserving minimum degree k, do not 
influence the proof. We would also like to point out that the results in both papers jl0|. l30j used the 



assumption that c <k + 2\Jk log k, but one can easily check that replacing 2 with any other constant 
will still work. Finally note that 2k± < k — 2 and either k — 2 or k — 1 is even. Thus, Lemma [4. 5 1 follows 
from the above results by the well-known fact that a 2s-factor contains a 2t-factor for all t < s. 

Proof of Theorem 14.11 (sketch) . We utilize the same approach, notation and terminology as in the 
proof of Theorem 12. li We start by summarizing all the results that have been established in the proof 
of that theorem which we use here. 
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Let S = (ex, . . . , e m ) be a uniformly chosen ordered subset of m edges out of the possible ones 
on the vertex set V = [n], m' = m — n/ log log n, and let G and H be the graphs on the vertex set 
V with edge sets E(G) = {ex, . . . , e m }, E(H) = {ex, . . . , e m /}. As G ~ C/(n, m), if Cr^ is empty then 
there is nothing left to prove. We therefore assume otherwise and aim to establish that contains 
a family of k± edge-disjoint Hamilton cycles with probability 1 — o(n~ 2 ). Given the above two graphs 
H C G on the same vertex set V, define the partition of the vertex set of G^ k \ the fc-core of G, 
into Bk = Bk(G,H) and Ck = Ck(G,H) so that Bk consists of all vertices of G^ having at least k 
neighbors in V(G^) already in the subgraph H and Ck = V(G^) \ Bk- Denote the indices of the 
subset of the final m — m! edges having both endpoints in Bk by J and let J = Recall that we 
have already established that upon conditioning on B^, Ck (such that Bp. UC^ ^ by our assumption) 
as well as J and all the edges S* = {ej : j ^ J~}, the remaining J edges of S are uniformly distributed 
over all edges missing from Bk (that is, edges with both endpoints in Bk that did not appear among 
ex,... ,e m /). Moreover we also proved that J > n/(1001oglogn) with probability 1 — 0(n~ 9 ). 

Letting F = (V,S*), as in the proof of Theorem [2TT1 we have V(F ( - k ^) = V(G^). Further note 
that G' = satisfies the conditions of Lemma 14.51 and therefore contains a (2/cx)-factor. By a 
well-known fact from graph theory, this (2&i)-factor can be decomposed into k\ edge-disjoint 2-factors 
Ax,... , A/ Cl . Observe that, due to Claim [374"1 with probability 1 — o(n~ 2 ) each 2-factor Aj has at most 
n/ Vlogn cycles. We have at our disposal a set J of J > n/(100 log log n) random edges to fall into the 
set Bk- Split J into k\ sets Ji,..., Jk x of nearly equal sizes \ J{\ > [J/k\\ and use J7i to convert the 
i-th 2-factor Aj into a Hamilton cycle, of course relying on the existing edges as the backbone. More 
specifically, we repeat k\ iterations of the following process. Suppose we have already constructed 
Hamilton cycles H\, . . . , Hj_x- Our goal is to convert A, into a Hamilton cycle Hi using the edges of 

Gi = G' - (Hi U . . . U Fj_x) U (A i+1 U . . . U A fcl ) 

and the random edges from Ji- Observe that the minimum degree of the relevant graph satisfies 
5(Gi) > k — (2ki — 2) > 5, thus making Lemma 14.31 applicable. By this lemma, all connected 
components of Gi are of size at least n/c 10 and therefore G{ has at most c 10 connected components. 
Moreover, since \Ck\ < 2 (to — to') = 2n/ (log log ra) , at least n/c 10 — \Ck\ > n/c 11 vertices of each 
component belong to Bk- Thus, adding any random edge in Bk will connect two of these components 
with probability at least c -22 . By standard concentration arguments, it is then easy to see that 
throwing in, say, log 2 n edges from Ji will turn G{ into a connected graph with probability 1 — o(n~ 2 ). 

Now we apply Lemma [2. 61 Due to this lemma together with the expansion property guaranteed by 
Lemma [4.31 at any step Gi contains at least n/(2c 20 ) boosters. Moreover, the number of such boosters 
with both endpoints in Bk is at least n/(2c 20 ) — n\Ck\ > n/c 21 . Thus, the probability that a random 
edge falling into Bk hits a booster is at least c~ 21 . Since Aj has only at most n/ylogn cycles to begin 
with, hitting n/y/log n+1 such boosters will transform it into a Hamilton cycle. Also note that we have 
not yet used the vast majority of the edges from Ji, and can still add \Ji\ — log 2 n > n/(2001oglogn) 
random edges to Gi- By adding all these edges, we collect a number of boosters which stochastically 
dominates a binomial random variable with parameters n/(2001oglogn) and c~ 21 . Therefore, by 
standard concentration arguments, we create the next Hamilton cycle Hi with probability at least 
1 — o(n~ 2 ). Finally, we release the edges of Aj not used in Hi back into the graph, guaranteeing that 
in the next iteration too the minimum degree of the backbone graph is at least 5. ■ 

Proof of Theorem 14.21 Observe first that due to Lemma 14.41 the difference in sizes of the fc-cores 
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(j( fc ) and H^ is relatively small: 

\V(G {k) )\ - \V(H^)\ <n- \V(HW)\ < n/c 200 . 

Let V = V(H^) and let R be the set of n random edges. For every edge e = (u,v) 6 R (uniformly 
distributed over missing edges in H) expose whether both u, v are in V' or not. If at least one of 
them is not in V expose both endpoints of e. Otherwise put e into R' and note that R' is the set of 
random edges uniformly distributed over all missing edges in the induced graph of H on V . Also note 
that since |V'| > (1 — c~ 200 )n, the number of edges in R' is stochastically dominated by the binomial 
random variable with parameters n and 2/3. Therefore with probability at least 1 — o(n~ 2 ) in the 
end of this process R' has at least n/2 edges. Moreover, this process reveals the set of vertices of the 
fc-core of the graph G obtained from H by adding edges in R. 

Fix a collection K.\, . . . , of edge-disjoint Hamilton cycles in H^ k \ We will use the set R' of 
random edges to convert them into edge-disjoint Hamilton cycles in G^ k \ To this end, split the set 
R' in k\ parts R\, . . . , R^ of sizes \Ri\ > n/{2>k\). We perform k\ iterations of the following process. 
Suppose that for i £ [k\] we have already created Hamilton cycles Hi, . . . , H^i in G^ k \ Let 

G t = G« - {H x U . . . U Hi-i) U (A i+1 U . . . U A fcl ) . 

By definition, the minimum degree of Gi is at least 5, enabling us to utilize Lemma 14.31 Due to this 
lemma, all connected components of Gi are of size at least n/c 10 and therefore Gi has at most c 10 
connected components. Moreover, at least n/c 10 — n/c 200 > n/c 20 vertices of each component belong 
to V . Thus adding any random edge in V will connect two of these components with probability at 
least c -40 . Therefore, by standard concentration arguments, it is easily seen that throwing in (say) 
first log 2 n edges from Ri will turn Gi into a connected graph with probability 1 — o(n~ 2 ). 

Next, as in the proof of Theorem 12.21 we add to Aj all vertices of F(G") — V(H^) (at most 
n/c 200 of them) to have a 2-factor with at most n/c 200 + 1 cycles. We merge these cycles into one 
gradually, using the remaining |-Rj| — log 2 n > n/(4k\) > n/c 2 of the edges of the Ri. By the expansion 
properties (Lemma l4.3|) of Gi, we can derive from Lemma f2.6l that there are at least n 2 /(2c 20 ) boosters 
in Gi. Moreover, all but n 2 /c 200 such boosters have both endpoints in V . Hence, the probability that 
a new edge, uniform over all missing edges in V' , is a booster is at least c -21 . Hitting n/c 200 + 2 such 
boosters will transform our 2-factor into a Hamilton cycle in Gi. Since the number of boosters that 
we encounter is stochastically dominated by a binomial variable Bin(n/c 2 , c -21 ), we deduce that with 
probability 1 — o(n~ 2 ) exposing random edges from Ri will create the next Hamilton cycle Hi in G". 
Finally, we release the edges of A, not used in Hi back into the graph, guaranteeing that in the next 
iteration too the minimum degree of the backbone graph would be at least 5. ■ 

5 Open Problems 

The main problem that remains open is, of course, whether the result of Theorem[T] (the Hamiltonicity 
of G\ for all t > w.h.p. provided k > 15) can be pushed all the way down to k > 3. The bottleneck 
in our proof, as mentioned before, was finding a 2-factor in the A:-core minus the unrevealed sprinkling 
edges. Due to the complexity of this graph, instead of appealing to Tutte's criterion for a 2-factor, we 
construct one using 2 disjoint perfect matchings. This appears to be possible at as long as k > 7. 
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By taking k > 15 we were able both to simplify the arguments used to recover this 2-factor, as well as 

(k) 

to guarantee that G t would remain Hamiltonian for all t > w.h.p. It is thus quite plausible that, 
when paired with a more refined analysis, the present framework should produce an improved bound 
on k. However, a new idea seems to be needed to take it down to k = 3. 

For the packing problem, while we are only one Hamilton cycle short of the conjectured packing of 
[(k — l)/2j edge-disjoint Hamilton cycles in the /c-core, the present methods appear to be incapable of 
bridging this small gap. In particular, in our cycle-by-cycle extraction argument it is crucial to have 
minimum degree 5 at every stage, so as to guarantee sufficient expansion of small sets (as given by 
Lemma |4, 31 and used by Lemma |2 . 6[) . Once we extract [(k — 3)/2j Hamilton cycles from the k-core, 
we will be left with a subgraph of minimum degree 3, and this by itself is not enough to ensure the 
required expansion (e.g., consider a set of degree-3 vertices forming a cycle). In conclusion, while new 
ideas are needed to fully settle this conjecture, our result provides a fairly firm evidence for its validity. 
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